首页> 外文OA文献 >Generating summary documents for a variable-quality PDF document collection
【2h】

Generating summary documents for a variable-quality PDF document collection

机译:为质量可变的PDF文档集生成摘要文档

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The Cochrane Schizophrenia Group’s Register of studies details all aspects of the effects of treating people with schizophrenia. It has been gathered over the last 20 years and consists of around 20,000 documents, overwhelmingly in PDF. Document collections of this sort – on a given theme but gathered from a wide range of sources – will generally have huge variability in the quality of the PDF, particularly with respect to the key property of text searchability.\ud\udSummarising the results from the best of these papers, to allow evidence-based health care decision making, has so far been done by manually creating a summary document, starting from a visual inspection of the relevant PDF file. This labour-intensive process has resulted, to date, in only 4,000 of the papers being summarised – with enormous duplication of effort and with many issues around the validity and reliability of the data extraction.\ud\udThis paper describes a pilot project to provide a computer-assisted framework in which any of the PDF documents could be searched for the occurrence of some 8,000 keywords and key phrases.Once keyword tagging has been completed the framework assists in the generation of a standard summary document, thereby greatly speeding up the production of these summaries. Early examples of the framework are described and its capabilities illustrated.
机译:Cochrane精神分裂症小组的研究记录详细介绍了治疗精神分裂症患者的效果的各个方面。在过去的20年中,它已经被收集起来,包含大约20,000个文档,绝大多数为PDF。此类文档集合(以给定主题为主题,但从各种各样的来源中收集)通常在PDF的质量方面具有巨大的可变性,尤其是在文本可搜索性的关键属性方面。\ ud \ ud迄今为止,这些文件中的最好的文件是通过从相关PDF文件的目视检查开始手动创建摘要文件来完成的,以允许基于证据的医疗保健决策。迄今为止,这种劳动密集型的过程仅汇总了4,000篇论文-付出了巨大的努力,并且围绕数据提取的有效性和可靠性存在许多问题。\ ud \ ud本文介绍了一个试点项目,旨在提供一个计算机辅助框架,可在其中搜索任何PDF文档以查找8,000个左右的关键字和关键词。一旦完成关键字标记,该框架将有助于生成标准摘要文档,从而大大加快了生产速度这些摘要中。描述了该框架的早期示例,并说明了其功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号